Knowledge-Grounded Target Group Language Recognition in Hate Speech

نویسندگان

چکیده

Hate speech comes in different forms depending on the communities targeted, often based factors like gender, sexuality, race, or religion. Detecting it online is challenging because existing systems are not accounting for diversity of hate identity target and may be biased towards certain groups, leading to inaccurate results. Current language models perform well identifying communities, but only provide a probability that text contains references particular group. This lack transparency problematic these learn biases from data annotated by individuals who familiar with To improve detection, particularly group identification, we propose new hybrid approach incorporates explicit knowledge about used specific groups. We leverage Knowledge Graph (KG) adapt it, considering an appropriate level abstraction, recognise speech-language related gender sexual orientation. A thorough quantitative qualitative evaluation demonstrates our as effective state-of-the-art while adjusting better domain changes. By grounding task knowledge, can contextualise results generated proposed groups most frequently impacted technologies. Semantic enrichment helps us examine model outcomes training detection systems, handle ambiguous cases human annotations more effectively. Overall, infusing semantic crucial enhancing understanding behaviors addressing derived data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Grounded Language Modeling for Automatic Speech Recognition of Sports Video

Grounded language models represent the relationship between words and the non-linguistic context in which they are said. This paper describes how they are learned from large corpora of unlabeled video, and are applied to the task of automatic speech recognition of sports video. Results show that grounded language models improve perplexity and word error rate over text based language models, and...

متن کامل

Using Natural-Language Knowledge Sources in Speech Recognition

At the current state of the art, high-accuracy speech recognition with moderate to large vocabularies (hundreds to tens of thousands of words) requires a language model. That is, in addition to incorporating a model of the correspondence between acoustic signals and words, a recognition system needs to incorporate a model of what sequences of words are possible, or at least likely. Typically, t...

متن کامل

Hate Me, Hate Me Not: Hate Speech Detection on Facebook

While favouring communications and easing information sharing, Social Network Sites are also used to launch harmful campaigns against specific groups and individuals. Cyberbullism, incitement to self-harm practices, sexual predation are just some of the severe effects of massive online offensives. Moreover, attacks can be carried out against groups of victims and can degenerate in physical viol...

متن کامل

Automated Hate Speech Detection and the Problem of Offensive Language

A key challenge for automatic hate-speech detection on social media is the separation of hate speech from other instances of offensive language. Lexical detection methods tend to have low precision because they classify all messages containing particular terms as hate speech and previous work using supervised learning has failed to distinguish between the two categories. We used a crowd-sourced...

متن کامل

A Survey on Hate Speech Detection using Natural Language Processing

This paper presents a survey on hate speech detection. Given the steadily growing body of social media content, the amount of online hate speech is also increasing. Due to the massive scale of the web, methods that automatically detect hate speech are required. Our survey describes key areas that have been explored to automatically recognize these types of utterances using natural language proc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Studies on the semantic web

سال: 2023

ISSN: ['1868-1158']

DOI: https://doi.org/10.3233/ssw230002